Topology and ambiguity 
in cj-context free languages 



Olivier Finkel 
Equipe de Logique Mathematique 
U.F.R. de Mathematiques, Universite Paris 7 
2 Place Jussieu 75251 Paris cedex 05, France. 
E Mail: finkel@logique.jussieu.fr 

Pierre Simonnet 
UMR CNRS 6134 
Faculte des Sciences, Universite de Corse 
Quartier Grossetti BP52 20250, Corte, France 
E Mail: simonnet@univ-corse.fr. 



Abstract 

We study the links between the topological complexity of an w-context free 
language and its degree of ambiguity. In particular, using known facts from 
classical descriptive set theory, we prove that non Borel w-context free lan- 
guages which are recognized by Biichi pushdown automata have a maximum 
degree of ambiguity. This result implies that degrees of ambiguity are really 
not preserved by the operation W — > , defined over finitary context free 
languages. We prove also that taking the adherence or the 5- limit of a finitary 
language preserves neither ambiguity nor inherent ambiguity. On the other 
side we show that methods used in the study of a;-context free languages can 
also be applied to study the notion of ambiguity in infinitary rational relations 
accepted by Biichi 2-tape automata and we get first results in that direction. 

Keywords: context free languages; infinite words; infinitary rational relations; ambiguity; 
degrees of ambiguity; topological properties; borel hierarchy; analytic sets. 

AMS Subject Classification: 68Q45; 03D05; 03D55; 03E15. 



1 



1 Introduction 



w-context free languages (w-CFL) form the class CFL^ of w-languages accepted by push- 
down automata with a Biichi or Muller acceptance condition. They were firstly studied 
by Cohen and Gold, Linna, Boasson, Nivat, |CG77j |Lin76j [BN80j |Niv77j . see Staiger's 
paper for a survey of these works |Sta97a| . A way to study the richness of the class C F 
is to consider the topological complexity of w-context free languages when the set S'^ of 
infinite words over the alphabet S is equipped with the usual Cantor topology. It is well 
known that all w-CFL as well as all w-languages accepted by Turing machines with a 
Biichi or a Muller acceptance condition are analytic sets. w-CFL accepted by determinis- 
tic Biichi pushdown automata are Hj-sets, while w-CFL accepted by deterministic Muller 
pushdown automata are boolean combinations of n2-sets. It was recently proved that 
the class CFL^ exhausts the finite ranks of the Borel hierarchy, [FinOlj . that there exists 
some w-CFL which are Borel sets of infinite rank, |Fin03b] . or even analytic but non Borel 
sets, |Fin03aj . 

Using known facts from Descriptive Set Theory, we prove here that non Borel w-CFL have 
a maximum degree of ambiguity: if L{A) is a non Borel u-CFL which is accepted by a 
Biichi pushdown automaton (BPDA) A then there exist 2^° w-words a such that A has 
2^" accepting runs reading a, where 2^° is the cardinal of the continuum. 

The above result of the second author led the first author to the investigation of the notion 
of ambiguity and of degrees of ambiguity in a;-context free languages, |Fin03c] . There exist 
some non ambiguous w-CFL of every finite Borel rank, but all known examples of w-CFL 
which are Borel sets of infinite rank are accepted by ambiguous BPDA. Thus one can 
make the hypothesis that there are some links between the topological complexity and the 
degree of ambiguity for w-CFL and such connections were firstly studied in [FinOScj . 
The operations W Adh{W) and W , where Adh(W) is the adherence of the 

finitary language IF C and is the (5-limit of W , appear in the characterization of 
n5 (i.e. closed)-subsets and Il^-subsets of T!^ , for an alphabet S, |Sta97a| . Moreover 
it turned out that the first one is useful in the study of topological properties of uj- 
context free languages of a given degree of ambiguity [FinOScj . We show that each of 
these operations preserves neither unambiguity nor inherent ambiguity from finitary to oj- 
context free languages. We deduce also from the above results that neither unambiguity 
nor inherent ambiguity is preserved by the operation W — > . This important operation 
is defined over finitary languages and is involved in the characterization of the class of oj- 
regular languages (respectively, of w-context free languages) as the w-Kleene closure of the 
class of regular (respectively, context free) languages |Tho90] |PP02j [Sta97a] [Sta97b| . 

On the other side we prove that the same theorems of classical descriptive set theory 
can also be applied in the case of infinitary rational relations accepted by 2-tape Biichi 
automata. The topological complexity of infinitary rational relations has been studied by 
the first author who showed in [Fin03d| that there exist some infinitary rational relations 
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which are not Borel. Moreover some undecidabihty properties have been estabhshed in 
|Fin03e] . We then prove some first results about ambiguity in infinitary rational relations. 

The paper is organized as follows. In section 2, we recall definitions and results about 
w-CFL and ambiguity. In section 3, Borel and analytic sets are defined. In section 4, we 
study links between topology and ambiguity in w-CFL. In section 5, we show some results 
about infinitary rational relations. 

2 cj- context free languages 

We assume the reader to be familiar with the theory of formal languages and of ui- 
regular languages, |Ber79j |Tho90j |Sta97aj |PP02j . We shall use usual notations of formal 
language theory. When S is a finite alphabet, a non-empty finite word over S is any 
sequence x = ai . . . ak, where a, G S for i = 1, . . . , A;, and k is an integer > 1. The length 
of X is k, denoted by . We write x{i) = ai and x[i] = x{l) . . . x{i) for i < k. We write 
also x[0] = A, where A is the empty word, which has no letter; its length is |A| = 0. S* is 
the set of finite words over S, and S"^ is the set of finite non-empty words over S. The 
mirror image of a finite word u will be denoted by u^. 

The first infinite ordinal is iv. An cj-word over S is an u; -sequence ai . . . a„ . . ., where 
Vi > 1 aj € S. The set of w-words over the alphabet S is denoted by S"^. An uj- 
language over an alphabet S is a subset of S"^. For V C S*, the w-power of V is the 
w-language = {cj = ui . . . ?i„ . . . e S"^ | Vi > 1 Uj G y - {A}}. LF{v) is the set of 
finite prefixes (or left factors) of the word v, and LF{V) = U„gv'LF(t;) for every language 
V of finite or infinite words. 

We introduce now w-context free languages via Biichi pushdown automata. 

Definition 2.1 A Biichi pushdown automaton is a 7-tuple A = {K,T,,T ,5,qo, Zq, F), 
where K is a finite set of states, S is a finite input alphabet, T is a finite pushdown 
alphabet, qo (z K is the initial state, Zq € F is the start symbol, F <^ K is the set of final 
states, and 5 is a mapping from K x (T,L) {A}) x F to finite subsets of K x T* . 
If ^ ^ F"*" describes the pushdown store content, the leftmost symbol will be assumed to be 
on "top" of the store. A configuration of the BPDA A is a pair {q,j) where q £ K and 
7 G F*. 

For a € SU{A}, 7, /3 E F* and Z if (p, 13) is in 6{q, a, Z), then we write a : {q, Z7) 1— 5-_4 
(p,/37)- 

Let a = a\a2 . . . a„ . . . be an co-word over S. A run of A on a is an infinite sequence 
r = 7i, where {qi,Ji)i>i is an infinite sequence of configurations of A and, for 

all i > 1, G {0, 1} and: 

1- (9i>7i) = (90,^0) 
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2. for each i > 1, there exists 6^ G S U {A} satisfying 
and ( Ei = iff bi = X ) 

and such that aia2 . . .an ■ ■ ■ = ^162 ■ ■ - bn ■ ■ ■ 

In{r) is the set of all states entered infinitely often during run r. 
The uj -language accepted by A is 

L[A) = {(T G Ti^ I there exists a run r of A on a such that In{r) n F 7^ 0} 

The class CFL^ of w-context free languages is the class of w-languages accepted by 
Biichi pushdown automata. It is also the cj-Kleene closure of the class CFL of context 
free finitary languages, where for any family C of finitary languages, the w-Kleene closure 
of £, is: uj - KC{C) = {Uf^-^Ui.Vi^ \ \fi E [1, n] Ui, Vi € £}. 

If we omit the pushdown stack and the A-transitions, we get the classical notion of Biichi 
automaton. Recall that the class REG^ of w-regular languages is the class of w-languages 
accepted by finite automata with a Biichi acceptance condition. It is also the w-Kleene 
closure of the class REG of regular finitary languages, |Tho90] |Sta97a] |PP02| . 

Notice that we introduced in the above definition the numbers £i S {0, 1} in order to 
distinguish runs of a BPDA which go through the same infinite sequence of configurations 
but for which A-transitions do not occur at the same steps of the computations. We can 
now briefly recall some definitions of |Fin03cj about ambiguity. 

We shall denote the cardinal of oj, and 2*^'' the cardinal of the continuum. It is also the 
cardinal of the set of real numbers and of the set T!^ for every finite alphabet S having at 
least two letters. 

Definition 2.2 Let A he a BPDA accepting infinite words over the alphabet S. For 
X € Tj^ let Oij({x) he the cardinal of the set of accepting runs of A on x. 

Lemma 2.3 ([FinOSc]) Let A be a BPDA accepting infinite words over the alphabet S. 
Then for all x it holds that a^(x) G N U {Hq, 2^f'}. 

Definition 2.4 Let A be a BPDA accepting infinite words over the alphabet E. 

(a) //sup{a^(x) I x G Tl^} G NU {2^"}, then a a = sup{a^(x) | x G S"^}. 

(b) //sup{a^(x) I X G S"^} = and there is no word x G S"^ such that a^(x) = 'Aq, 
then a A = ^0 • 

fb^Q does not represent a cardinal but is a new symbol that we introduce to conve- 
niently speak of this situation). 

(c) //sup{a^(a;) | x G S'^} = and there exists (at least) one word x G S'^ such that 
a^(x) = Hq; then oa = ^0 
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Notice that for a BPDA A, aj^ = iff ^ does not accept any u-word. 

N U {Nq , Ho, 2*^0} is linearly ordered by the relation < defined by VA: G N, A; < A; + 1 < 

^0 < ^0 < 2^". Now we can define a hierarchy of w-CFL: 

Definition 2.5 For A; G N U {Kg", ^^o, 2^"} let 
CFL^ia <k) = {L{A) \ A is a BPDA with < k} 
CFL^{a < k) = {L{A) \ A is a BPDA with a a < k} 

NA — CFL^ = CFL^[a < 1) is the class of non ambiguous co-context free languages. 
For every integer k such that k > 2, or k £ {^q, ^Oi 2*^*'}, 
A{k) - CFL^ = CFL^ia < k) - CFL^{a < k) 

If L £ A{k)-CFL^ with k eN,k >2, or k e {Hg , Nq, 2^"}^ fj^^^ L is said to be inherently 
ambiguous of degree k. 

Recall that one can define in a similar way the degree of ambiguity of a finitary context 
free language. If M is a pushdown automaton accepting finite words by final states (or 
by final states and topmost stack letter) then om £ N or oa/ = or om = ^o- However 
every context free language is accepted by a pushdown automaton M with um < > 
|ABB96] . We shall denote, with similar notations as above, the class of non ambiguous 
context free languages by NA — CFL and the class of inherently ambiguous context free 
languages of degree /c > 2 by A{k) — CFL. Then ^(Hg ) — CFL is usually called the class 
of context free languages which are inherently ambiguous of infinite degree, |Her97| . 

Now we can state some links between cases of finite and infinite words. 

Proposition 2.6 ([F inOSc] ) Let V C.Ti* be a finitary context free language and d be a 
new letter not in Ti, then the following equivalence holds for a// A: G N U {^g }•' 

V.d"^ is in CFL^{a < k) iffV is in CFL{a < k) 

3 Borel and analytic sets 

We assume the reader to be familiar with basic notions of topology which may be found 
in [MosSnj [LT94] [Kec95j |Sta97aj [PP02]. 

For a finite alphabet X we shall consider X'^ as a topological space with the Cantor 
topology. The open sets of X'^ are the sets in the form W.X'^ , where W C X*. A set 
L C X^ is a closed set iff its complement X^ — L is an open set. 
Define now the hierarchy of Borel sets of finite ranks: 

Definition 3.1 The classes SjJ and IlJJ of the Borel hierarchy on the topological space 

X^ are defined as follows: 

1^5 the class of open sets of X^ . 

is the class of closed sets of X^ . 
And for any integer n>\: 

^n+i class of countable unions of Yl^-subsets of X'^ . 

nn+i is the class of countable intersections of Yl^- sub sets of X'^ . 
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The Borel hierarchy is also defined for transfinite levels, but we shall not need them in 
the present study. The class of Borel subsets of is the closure of the class of open 
subsets of X'^ under complementation and countable unions (hence also under countable 
intersections) There are also some subsets of X'^ which are not Borel. In particular the 
class of Borel subsets of X^ is strictly included into the class Sj; of analytic sets which 
are obtained by projection of Borel sets. 

Notice that if S and F are two finite alphabets then the product S'^ x can be identified 
with the space (S x T)"^ and we always consider in the sequel that such a space T!^ x 
is equipped with the Cantor topology. 

Definition 3.2 A set A C S"^ is an analytic set if there is a finite alphabet T and a Borel 
set 5 C X such that A = {a G S'^ | 3/3 G T'^ (a, (3) G B}. 

A set C C T,^ is coanalytic if its complement S"^ — C is analytic. The class of analytic 
sets is denoted Sj; and the class of coanalytic sets is denoted 

Recall also the notion of completeness with regard to reduction by continuous functions. 
For an integer n > 1, a set F C X"^ is said to be a 5]° (respectively, 11°, H])- 
complete set iff for any set E C Y'^ (with Y a finite alphabet): E G (respectively, 
G n°, Eg S^, E gUI) iff there exists a continuous function f : Y"^ ^ X"^ such that 
E = f-\F). 

(respectively, n°)-complete sets, with n an integer > 1, are thoroughly characterized 
in |Sta86j . 

4 Topology and ambiguity in cc;-context free lan- 
guages 

Let S and X be two finite alphabets. If i? C T,^ x X'^ and a G T,^ , the section in 
a of -B is Ba = {/3 G X'^ \ (a, (3) G B} and the projection of B on S"^ is the set 
PROJj:^{B) = {a g S"^ I 5a / 0} = {a e I 3/3 (a,/3) G B}. 

We are going to prove the following lemma which will be useful in the sequel: 

Lemma 4.1 Let S and X be two finite alphabets having at least two letters and B be a 
Borel subset of T,^ x X^ such that PROJsui(B) is not a Borel subset of T,^ . Then there 
are 2^" uo-words a G S"^ such that the section Ba has cardinality 2^°. 

Proof. Let S and X be two finite alphabets having at least two letters and i? be a Borel 
subset of S'^ X X'^ such that PROJt.'-{B) is not Borel. 

In a first step we shall prove that there are uncountably many a G T!^ such that the 
section Bn is uncountable. 
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Recall that by a Theorem of Lusin and Novikov, see |Kec95l page 123], if for all a £ S"^, 
the section of the Borel set B was countable, then PROJy.^{B) would be a Borel 
subset of S'^. 

Thus there exists at least one a € S"^ such that B^ is uncountable. In fact we have not 
only one a such that B^ is uncountable. 

For a G S'^ we have {a} x Ba = B n [{a} x X^]. But {a} x is a closed hence Borel 
subset of T,^ X X'^ thus {a} x B^ is Borel as intersection of two Borel sets. 

If there was only one a S S'^ such that Ba is uncountable, then C = {a} x Ba would be 
Borel so D = B — C would be borel because the class of Borel sets is closed under boolean 
operations. 

But all sections of D would be countable thus PROJy,^{D) would be Borel by Lusin and 
Novikov's Theorem. Then PROJy.^{B) = {a} U PROJt.^{D) would be also Borel as 
union of two Borel sets, and this would lead to a contradiction. 

In a similar manner we can prove that the set C/ = {a € T!^ \ Ba is uncountable } is 
uncountable, otherwise U = {aoj ai, • • • . . .} would be Borel as the countable union of 
the closed sets {ai}, i > 0. 

For each n > the set {an} x Ba„ would be Borel, and C = Urig^{a„} x Ba^ would be 
Borel as a countable union of Borel sets. So D = B — C would be borel too. 
But all sections of D would be countable thus PROJy,^{D) would be Borel by Lusin and 
Novikov's Theorem. Then PROJy.^{B) = U L) PROJj^^{D) would be also Borel as union 
of two Borel sets, and this would lead to a contradiction. 

So we have proved that the set {a € | B^ is uncountable } is uncountable. 

On the other hand we know from another Theorem of Descriptive Set Theory that the set 
{a € S'^ I Ba is countable } is a n];-subset of T,^, see |Kec951 page 123]. 
Thus its complement {a £ T,^ \ Ba is uncountable } is analytic. But by Suslin's Theorem 
an analytic subset of S"^ is either countable or has cardinality 2*^" , |Kec951 p. 88] . Therefore 
the set {a € S"^ | Ba is uncountable } has cardinality 2^". 

Recall now that we have already seen that, for each a £ S"^, the set {a} x Ba is Borel. We 
can then infer that Ba itself is Borel by considering the function h : X'^ S"^ x X'^ defined 
by h{a) = (a, a) for all a £ X^. The function h is continuous and Ba = h~^{{a} x Ba)- 
So Ba is Borel because the inverse image of a Borel set by a continuous function is a Borel 
set. Again by Suslin's Theorem Ba is either countable or has cardinality 2^". From this 
we deduce that {a £ S"^ | Ba is uncountable } = {a £ S'^ | Ba has cardinality 2^°} has 
cardinality 2^" . □ 

We can now infer some results for w-context free languages. 
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Theorem 4.2 Let L{A) be an lo-CFL accepted by a BPDA A such that L{A) is an an- 
alytic but non Borel set. The set of u-words, which have 2^" accepting runs by A, has 
cardinality 2^'^. 

Proof. Let A = (K, S, F, 6, go, Zq, F) be a BPDA such that L{A) is an analytic but non 
Borel set. 

To an infinite sequence r = {qi,^i,£i)i>i, where for all i > 1, € i^, 7i € r+ and 
Ei G {0, 1}, we associate an cj-word f over the alphabet X = F U U {0, 1} defined by 

f = gi.7i.ei.g2-72-e2 • • • qi-li-£i ■ ■ ■ 

Then to an infinite word a € S"^ and an infinite sequence r = {qi,^i,£i)i>i, we associate 
the couple {a, f)€^'^x{TUKU {0, 1})'^. 

Recall now that n2-subsets of a Cantor set T,^ are characterized in the following way. For 
W ^Ti* the (5-limit of W is the set of w-words over E having infinitely many prefixes 
in W: = {a eY.'^ \ 3'^i such that a{l) . . . a{i) G W}. Then a subset L of E"^ is a 

n^-subset of S"^ iff there exists a set C S'^ such that L = W\ [Sta97a| |PP02] . 

It is then easy to see that the set 

i? = {((J, f) I f is the code of an accepting run of A over a} 

is a n2-subset of S"^ x X'^ = (S x X)^ as intersection of two n2-sets. In fact we have 
R = {R'Y n {R"Y where i?' C (S x X)+ is the set of couples of words {u, v) in the form: 

u = ai.a2 - ■ ■ ■ ap 

V = ft. 71. £1.^2 -72 -£2 • • • qn-ln-Sn 

where for each i G [l,p] ai G S, for each i G [l,n] qi G -ftT, 7i G F"^ and £i G {0,1}. 
Moreover \u\ = e„ = 1, and 

1- (9i,7i) = 

2. for each z G [l,n — 1], there exists 6j G S U {A} satisfying 

h : (ft,7i) (9i+i,7i+i) 
and ( = iff 6j = A ) 

and such that bih2 ■ ■ ■ bn-i is a prefix of u = 01.02- . . . ap. 

And R" Q (T, X X)^ is the set of couples of words {u,v) G T,^ x X~^ such that |u| = \v\ 
and the last letter of v is an element q & F. 

In particular i? is a Borel subset of T,'^ x X"^. But by definition of R it turns out that 
PROJj:^{R) = L{A) so PROJj:^{R) is not Borel. Thus Lemma O implies that there 
are 2^° w-words a G S"^ such that Ra has cardinality 2^°. This means that these words 
have 2^0 accepting runs by the Biichi pushdown automaton A. □ 
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Example 4.3 Let S = {0, 1} and d be a new letter not in S and 

D = {u.d.v \u,v e'S* and {\v\ = 2\u\) or {\v\ = 2\u\ + 1) } 

D C (S U {d})* is a context free language. Let g : ^ ^((^^ U {d})*) be the substitution 
defined by g{a) = a.D. As W = 0*1 is regular, g{W) is a context free language, thus 
{g{W))^ is an uj-CFL. It is proved in IFinOSa^ that {g{W))'^ is Ti\-complete. In particular 
{giW))^ is an analytic non Borel set. Thus every BPDA accepting {g{W)Y has the 
maximum ambiguity and {g{W))'^ € ^(2*^°) — CFL^. 

On the other hand we can prove that g{W) is a non ambiguous context free language. 

For that purpose consider a (finite) word x G g{W); then x G g{G^.l) for some integer 
n > 0. Therefore x may be written in the form 

X = {).Ui.d.Vi.d.U2.d.V2 ■ ■ ■ O.Un-d.Vn.l.Un+l-d.Vn+l 

where Ui.d.Vi € D holds for all i (z [1, n + 1] . It is easy to see that the length \vn+i \ and the 
word Vn+i are determined by the word x: Vn+i is the suffix of x following the last letter 
d of X, and \vn+i\ = 2|'u„_(_i| (if \vn+i\ is even) or \vn+i\ = 2|?i„_|_i| + 1 (if |fn+i| is odd) 
thus \un+i\ is determined by \vn+i\ hence Un+i is also determined. Next one can see that 
Vn also is fixed by x (the word Vn-^-Un+i is the segment of x which is located between the 
n^^ and the (n + 1)*'^ occurrences of the letter d in x and knowing Un+i gives us Vn). 
We can similarly prove by induction on the integer k that the words i^n+i-fc <ind Un+i-k, 
for k G [0,n], are uniquely determined by x. 

Therefore the word x admits a unique decomposition in the above form. We can then easily 
construct a pushdown automaton (and even a one counter automaton) which accepts the 
language g{W) and which is non ambiguous. So the language g{W) is a non ambiguous 
context free language. 

The above example shows that the u-power of a non ambiguous context free language may 
have maximum ambiguity. Conversely consider the context free language y = Vi U V2 ^ 
{a,b,c}* where Vi = {a"'b"'cP \ n > 1, p > 1} and V2 = {a^bPc^ \ n > 1, p > \}. Vi and 
V2 are deterministic context free, hence they are non ambiguous context free languages. 
But their union V is an inherently ambiguous context free language |Mau69j . V* is a 
context free language which is inherently ambiguous of infinite degree (and it is proved in 
[^j98| that it is even exponentially ambiguous in the sense of Naji and Wich, see also 
[Wic99j about this notion). Let then L = V* U {o, b, c}. The language L is still a context 
free language which is inherently ambiguous of infinite degree and L'^ = {a, 6, c}"^ is an 
w-regular language hence it is a non ambiguous w-context free language. 

We have then proved that neither unambiguity nor inherent ambiguity is preserved by the 
operation L — > L^: 



9 



Proposition 4.4 

1. There exists a non ambiguous context free finitary language L such that is in 
^(2«o) _ CFL^. 

2. There exists a context free finitary language L, which is inherently ambiguous of 
infinite degree, such that is a non ambiguous oj-context free language. 

We can also consider the above mentioned language g{W) in the context of code theory. We 
have proved that g{W) is a non ambiguous context free language. By a similar reasoning 
we can prove that g(W) is a code, i.e. that every (finite) word y G g{W)~^ has a unique 
decomposition y = xi.X2 ■ ■ ■ Xn in words Xi € g{W). 

On the other side g{W) is not an cj-code, i.e. some words z G g{W)^ have several 
decompositions in the form z = X1.X2 ■ ■ ■ Xn ■ ■ ■ where for all i > 1 Xj G g(W). In fact we 
can get a much stronger result, using Lemma l4.ll 

Fact 4.5 There are 2^*' uj-words in g{W)^ which have 2^" decompositions in words in 

g{w). 

Proof. We can fix a recursive enumeration 6 of the set giW). So the function : N ^ 
g{W) is a bijection and we denote Ui = 6{i). 

Let now V be the set of couples (a, x) G {0, l}"^ x (S U {d})'^ such that: 
1. o" G (0*.l)'^, so a may be written in the form 

a = 0"M.0"2.1.0"M...0"M.0"p+i.l... 
where Vi > 1 > 0, and 

2. 

^ — •Un2 ■^n:j • • • Ufip -Uup^i ■ ■ ■ 

T> is a Borel subset of {0, 1}"^ x (S U {d})^ because it is accepted by a determinis- 
tic Turing machine with a Biichi acceptance condition [Sta97a] . On the other hand 
Pi?0 J(su{(i})" (^) = 9{W)'^ is not Borel and Lemma 14.11 implies that there are 2^" u)- 
words X in g{W)'^ such that Vx has cardinality 2*^". This means that there are 2^° 
w-words X G g{W)'^ which have 2^° decompositions in words in g{W). 
We can say that the code g{W) is really not an w-code ! □ 

The result given by Theorem 14.21 may be compared with a general study of topological 
properties of transition systems due to Arnold [Arn83a] . If we consider a BPDA as a tran- 
sition system with infinitely many states, Arnold's results imply that every non ambiguous 
w-CFL is a Borel set. On the other side deterministic w-CFL have not a great topological 
complexity, because they are boolean combinations of n2-sets. We know some examples 
of non ambiguous w-CFL of every finite Borel rank, but none of infinite Borel rank. These 
results led the first author to the following question: are there some more links between 
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the topological complexity of an w-CFL and the ambiguity of BPDA which accept it? In 
|Fin03cj the well known notions of degrees of ambiguity for CFL are extended to w-CFL 
and such supposed connections are investigated. In particular, using results of Duparc 
on the Wadge hierarchy, which is a great refinement of the Borel hierarchy |Dup01| , it is 
proved that for each k such that k is an integer > 2 or A; = i^Q and for each integer n > 1, 
there exist in A[k) — CFL,^ some SjJ-complete w-CFL and some n°-complete w-CFL. 
In the proofs of these results is used the operation W AdhiW) where for a finitary 
language W C E*, Adh{W) = {a ^T.^ \ LF{a) C LF{W)} is the adherence of W. We 
recall that a set L C S"^ is a closed set of S'^ iff there exists a finitary language 1^ C S* 
such that L = Adh{W). 

It is well known that if is a context free language, then Adh{W) is in CFL^. Moreover 
every closed (deterministic ) w-CFL is the adherence of a (deterministic ) context free 
language, |Sta97a| . 

So the question of the preservation of ambiguity by the operation W — > Adh{W) naturally 
arises. 

Proposition 4.6 Neither unambiguity nor inherent ambiguity is preserved by taking the 
adherence of a finitary context free language. 

Proof. (I) We are firstly looking for a non ambiguous finitary context free language which 
have an inherently ambiguous adherence. Let then the following finitary language over 
the alphabet {a, b, c, d}: 

Li = {a'^b^d'.d^^ j n,p,i are integers > 1} U {a^b^c^.d^*"^^ | n,p,i are integers > 1} 

Li is the disjoint union of two deterministic (hence non ambiguous) finitary context free 
languages thus it is a non ambiguous CFL because the class NA — CFL is closed under 
finite disjoint union. It is easy to see that the adherence of Li is 

Adh{Li) = {a'^} [j a+.b'^ y {a"6" | n > Ij.c"^ |J(Vi U V2).d'^ 

where Vi = {a'^b^'cP | n > 1, p > 1} and ^2 = {a'^bPcP | n > 1, p > 1}. Then it holds that 
Adh{Li) n a+.6+.c+.d^ = {Vi U V2).d'^ = V.d^, where ^ = U V2. 

By proposition 1 2. 6 1 the w-context free language V.d^ is inherently ambiguous because V is 
inherently ambiguous |Mau69j . Thus Adh{Li) is inherently ambiguous because otherwise 
V.d^ would be non ambiguous because the class NA — CFL^^ is closed under intersection 
with w-regular languages |Fin03cj . and .b'^ .c'^ .d^ is an w-regular language. 

(II) We are now looking for an inherently ambiguous context free language which have 
a non ambiguous adherence. We shall use a result of Crestin, |Cre72j : the language 
C = {u.v \ u,v ^ {a, b}^ and = u and = v} is a context free language which is 
inherently ambiguous (of infinite degree). In fact C = where Lp = {v G {a, b}^ \ = 
v} is the language of palindromes over the alphabet {a,b}. Consider now the adherence 
of the language C. Adh{C) = {a, b}^ holds because every word u G {a, 6}* is a prefix of a 
palindrome (for example of the palindrome u.u^) hence it is also a prefix of a word of C. 
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Thus C is inherently ambiguous and Adh{C) is a non ambiguous w-context free language 
because it is an w-regular language. □ 



We have seen that closed sets are characterized as adherences of finitary languages. Sim- 
ilarly we have already seen, in the proof of Theorem 14.21 that n2-subsets of Tj^ are 
characterized as 5-limits of finitary languages Vl^ C S*. 

Recah that W G REG imphes that G REG^. But there exist some context free lan- 
guages L such that is not in CFL^; see [Sta97a] for an example of such a language L. 
In the case W G CFL and G CFLi_j, the question naturally arises of the preservation 
of ambiguity by the operation W — > . The answer is given by the following: 

Proposition 4.7 Neither unambiguity nor inherent ambiguity is preserved by taking the 
5-limit of a finitary context free language. 

Proof. (I) Let again Li be the following finitary language over the alphabet {a,6, c, d}: 

Li = {a"6"cP.d^* \n,p,i are integers > 1} U {a'^lPcP.d^'-^^ \n,p,i are integers > 1} 

Li is a non ambiguous CFL. And the (5-limit of the language Li is (Li)^ = {Vi U V2)-d'^ = 
V.d'^ . We have already seen that this w-language is an inherently ambiguous ui-CFh. 

(II) Consider now the inherently ambiguous context free language V = {a"'6"c^ \ ri,p > 
1} U {a"6PcP \n,p> 1}. Its (5-limit is = {a". 6" | n > Ij.c'^. It is easy to see that is 
a deterministic w-CFL hence it is a non ambiguous a;-CFL. □ 

5 Topology and ambiguity in infinitary rational 
relations 

Infinitary rational relations are subsets of S"^ x F"^, where S and F are finite alphabets, 
which are accepted by 2-tape Biichi automata. 

We are going to see in this section that some above methods can also be used in the case 
of infinitary rational relations. 

Definition 5.1 A 2-tape Biichi automaton (2-BA) is a sextuple T = {K,T,,V,lS.,qQ,F), 
where K is a finite set of states, S and F are finite alphabets, A is a finite subset of 
K X T,* X r* X K called the set of transitions, qo is the initial state, and F C. K is the set 
of accepting states. 

A computation C of the 2-tape Biichi automaton T is an infinite sequence of transitions 

(qo,ui,vi,qi), {qi,U2,V2,q2), ■ ■ ■ {qi-^i,Ui,Vi,qi), (g^, Uj+i, u^+i, Qj+i), . . . 

The computation is said to be successful iff there exists a final state qj ^ F and infinitely 
many integers i > such that qi = qf. 
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The input word of the computation is u = ui.U2-U3 . . . 
The output word of the computation is v = vi.V2-Vs . . . 
Then the input and the output words may be finite or infinite. 

The infinitary rational relation R{T) C T,^ x F"^ accepted by the 2-tape Biichi automaton 
T is the set of couples (n, v) € T,^ x such that u and v are the input and the output 
words of some successful computation C of T . 
The set of infinitary rational relations will be denoted RAT^. 

One can define degrees of ambiguity for 2-tape Biichi automata and for infinitary rational 
relations as in the case of BPDA and a;-CFL. 

Definition 5.2 Let T be a 2-BA accepting couples of infinite words of Ti^ x . For 
{u,v) € S"^ X r*^, let ar{u,v) be the cardinal of the set of accepting computations ofT on 
{u,v). 

Lemma 5.3 Let T be a 2-BA accepting couples of infinite words {u,v) £ T,^ x T"^. Then 
for all {u,v) G E'^ X r'^ it holds that ariu.v) G NU {Ho,2^o}. 

The proof that a value between and 2*^° is impossible follows from Suslin's Theorem 
because one can obtain the set of codes of accepting computations of T on (u, v) as a 
section of a Borel set (see proof of next theorem) hence as a Borel set. A similar reasoning 
was used in the proof of Lemma 12.31 |Fin03c] . 

Definition 5.4 Let T be a 2-BA accepting couples of infinite words {u,v) € S'^ x . 

(a) If sup{ariu,v) \ {u,v) G S"^ x F"^} G NU{2*~'o}, then ar = sup{ariu,v) \ {u,v) G 
S'^ X T'^}. 

(b) If sw£>{ar{u,v) \ {u,v) G H"^ x r"^} = and there is no {u,v) G S'^ x L"^ such that 
ar{u,v) = i^o, then ar = ^q- 

(c) If sxi'p{aT{u,v) I {u,v) G S"^ x L^} = i^o and there exists (at least) one couple 
{u,v) G S"^ X such that aq-iujv) = Nq; then aj- = 

The set N U {^^g , ^o, 2^"} is linearly ordered as above by the relation <. 

Definition 5.5 For /c G N U {Kg , ^^o, 2^"}, let 
RAT^{a <k) = {R{T) \T is a2- BA with ar < k} 
RAT^{a <k) = {rIt) \T is a2- BA with ar < k} 

NA — RATi^ = RATi^(a < 1) is the class of non ambiguous infinitary rational relations. 
For every integer k >2, or k £ {^o ' ^O) 2*^"}, 

A{k) — RAT^ = RAT^{a < k) — RAT^{a < k) is the class of infinitary rational relations 
which are inherently ambiguous of degree k. 

As for cj-context free languages, one can use Lemma l4.1l to prove the following result. 
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Theorem 5.6 Let R{T) C S"^ x he an infinitary rational relation accepted by a 2-tape 
Biichi automaton T such that R{T) is an analytic but nan Borel set. The set of couples 
of u- words, which have 2^" accepting computations by T , has cardinality 2^*'. 

Proof. It is very similar to proof of Theorem 14.21 Let R{T) C S'^ x be an infinitary 
rational relation accepted by a 2-tape Biichi automaton T = (A', S, F, A, go, We 
assume also that R{T) is an analytic but non Borel set. To an infinite sequence 

C = {qo,ui,vi,qi), {qi,U2,V2,q2), ■ ■ ■ {qi-i,Ui,Vi, qi), (g^, Uj+i, Vj+i, gj+i), . . . 

where for all z > 0, (/j G A', for alH > 1, € and Vi € T*, we associate an a;- word C 
over the alphabet X = K UY^UT L) {e}, where e is an additional letter. C is defined by: 

C = qo.ui.e.vi.qi.U2.e.V2.q2 ■ ■ ■ qi-Ui+i.e.Vi+i.qi+i . . . 

Then the set 

{(u, V, C) € S"^ X r'^ X X'^ I C is the code of an accepting computation of T over (u, v)} 

is accepted by a deterministic Turing machine with a Biichi acceptance condition thus it 
is a n2-set. We can conclude as in proof of Theorem 14.21 □ 

The first author showed that there exist some I]];-complete, hence non Borel, infinitary 
rational relations |Fin03dj . So we can deduce the following result. 

Corollary 5.7 There exist some infinitary rational relations which are inherently am- 
biguous of degree 2^" . 

Remark 5.8 Looking carefully at the example of non Borel infinitary rational relation 
given in ]Fin03d!^ . we can find a rational relation S over finite words such that S is non 
ambiguous and is non Borel. So S is a finitary rational relation which is non ambiguous 
but S'^ has maximum ambiguity because S'^ € ^(2*^°) — RAT^^ holds by Theorem \5.6l 

Moreover the question of the decidability of ambiguity for infinitary rational relations 
naturally arises. It can be solved, using another recent result of the first author. 

Proposition 5.9 ( [Fin03e] ) Let X and Y be finite alphabets containing at least two 
letters, then there exists a family T of infinitary rational relations which are subsets of 
X^ X , such that, for R G either R = X^ x Y'^ or R is a Y^-complete subset of 
X^ X Y^ , but one cannot decide which case holds. 

Corollary 5.10 Let k be an integer >2 or k G {^q , ^o}- Then it is undecidable to deter- 
mine whether a given infinitary rational relation is in the class RATi_j{a < k) (respectively 
RAT^{a < k)). 

In particular one cannot decide whether a given infinitary rational relation is non ambigu- 
ous or is inherently ambiguous of degree 2^° . 
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Proof. Consider the family T given by Proposition 15.91 and let R ^ J-. 
li R = X then R is obviously non ambiguous but if ii is a 5]{-complete subset of 
X'^ X then by Theorem 15.61 the infinitary rational relation R is inherently ambiguous 
of degree 2^" . But one cannot decide which case holds and this ends the proof. □ 

Acknowledgements. We thank Dominique Lecomte and Jean-Pierre Ressayre for useful 
discussions and the anonymous referees for useful comments on a preliminary version of 
this paper. 
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